Log-Euclidean bag of words for human action recognition

نویسندگان

  • Masoud Faraki
  • Maziar Palhang
  • Conrad Sanderson
چکیده

Representing videos by densely extracted local space-time features has recently become a popular approach for analysing actions. In this paper, we tackle the problem of categorising human actions by devising Bag of Words (BoW) models based on covariance matrices of spatio-temporal features, with the features formed from histograms of optical flow. Since covariance matrices form a special type of Riemannian manifold, the space of Symmetric Positive Definite (SPD) matrices, non-Euclidean geometry should be taken into account while discriminating between covariance matrices. To this end, we propose to embed SPD manifolds to Euclidean spaces via a diffeomorphism and extend the BoW approach to its Riemannian version. The proposed BoW approach takes into account the manifold geometry of SPD matrices during the generation of the codebook and histograms. Experiments on challenging human action datasets show that the proposed method obtains notable improvements in discrimination accuracy, in comparison to several state-of-the-art methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Identification of Human Actions in Video Database

The Human activity detection is a more highly sought research as they are used for the Surveillance, Healthcare system, content-based search, and interactive game applications. The Human activity recognition is carried out by three main stages namely object segmentation, feature extraction, their representation and human action detection using different algorithms. This paper is an attempt to i...

متن کامل

Learning Mid-level Words on Riemannian Manifold for Action Recognition

Human action recognition remains a challenging task due to the various sources of video data and large intraclass variations. It thus becomes one of the key issues in recent research to explore effective and robust representation to handle such challenges. In this paper, we propose a novel representation approach by constructing mid-level words in videos and encoding them on Riemannian manifold...

متن کامل

Discriminant Bag of Words based representation for human action recognition

In this paper we propose a novel framework for human action recognition based on Bag of Words (BoWs) action representation, that unifies discriminative codebook generation and discriminant subspace learning. The proposed framework is able to, naturally, incorporate several (linear or non-linear) discrimination criteria for discriminant BoWs-based action representation. An iterative optimization...

متن کامل

Human Action Recognition Based on Boosting

Human action recognition is an active research field in computer vision and image processing. In this paper we propose a novel method for the task of recognition of human actions in video image sequences. First of all, a video sequence is represented as a collection of spatial-temporal words by extracting space-time interest points, which is used to characterize action. Then visual words are us...

متن کامل

Efficient Codebook for Human Activity Recognition in Surveillance Video

M. E. Student, Dept of Computer Science, Annamalai University, India. Associate Professor, Dept of Computer Science, Annamalai University, India. Research Scholar, Dept of Computer Science, Annamalai University, India. [email protected], [email protected], [email protected] ABSTRACT—Automatic human activity recognition methods are useful for many applications such as Video Surveillanc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IET Computer Vision

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2015